Retractions of Types with Many Atoms 



Laurent Regnier 
Institut de Mathematiques de Lummy 
163 Avenue de Luminy, Case 907 
13288 Marseille Cedex 9, France 
regnier@iml . univ-mrs . f r 



Pawel Urzyczyn* 
Uniwersytet Warszawski 
Instytut Informatyki 
Banacha 2, 02-097 Warsaw, Poland 
urzyOmimuw . edu . pi 



Corrected version, September 2002 



Abstract 

We define a sound and complete proof system for affine /^-retractions in simple types built 
over many atoms, and we state simple necessary conditions for arbitrary /Sry-retractions in 
simple and polymorphic types. 



1 Introduction 



Whenever functions between different domains are of interest, it is always the case that the notion 
of a monomorphism and an isomorphism are fundamental. If we think of typed lambda calculi as 
of theories of functions between types understood as domains, we may be surprised by how little 
is understood about mono- and epi-morphisms between types. 

In the untyped lambda calculus, there are no domains, and there is a notion of an invertible 
term, corresponding to an abstract notion of an isomorphism, and the notions of right and left 
invertibility, corresponding to epi- and monomorphisms, respectively. The main results concerning 
right and/or left invertibility were obtained mostly in the 70's (see e.g. ^ |[ |9j) and an exposition 
of the theory can be found in Chapter 21 of [El. But not everything has been understood in full, 
and there is still a progress in this line of research, see || . 

The first paper, to our best knowledge, that provides a characterization of isomorphic types (simple 
types with products and an inhabited type constant), is Soloviev's work (l^] from 1981, published 
in English in 1983. Bruce and Longo have obtained a similar characterization in [[|, and they also 
introduced the notion of a type retraction. In fact, they only consider /3-retractions. It turns out 
however, that the more general case of /3?y-retractions is more difficult. 

We say that a type p is a retract of of a type r, and we write p < r, iff there exists a pair of 
terms F : p — > t and G : t — * p such that G o F =p v I, where composition G o F is understood 
as Xx.G(Fx). The subtlety in this definition is that types of free variables in F and G may be 
arbitrary, so a more adequate statement (the lack of which causes confusion among the readers 
of |3)) is "E h F : p — > r and E h G : r — > p. for a certain type environment E" . If the 

* Partly supported by KBN Grant 8 T11C 028 20. An essential part of this work was done while the second 
author was visiting IML in Marseille, thanks to CNRS funding. 
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environment E is empty (i.e., F and G are combinators) then we say that p is embedded into r, 
and we write p < r. 

If we replace /3?7-equality in the above definition by /3-equality (i.e., when G o F = | g I) we obtain 
the notion of a /3-retraction (resp. /3-embedding) , denoted by p <p r (resp. p <^ r). 

We say that types p and r are isomorphic iff there are combinators i* 1 and G, satisfying both 
G o F —f3r\ I and F o G =p v I. The definition of a /3-isomorphism is similar. 

All the above definitions are generic in that they apply to every reasonable type system. 

The notions of /3-retractability and /3-embeddability in simple types can be easily characterized as 
follows (see |, HI): 

p<pr iff r = ti — > • • • — > T fe — > p, for some r x , . . . , r fc ; 

p </3 t iff t = Ti — > • • • — > Tfc — > p, for some ri, . . . , Tfc, such that r h Ti, for all i. 

In the above, the notation r h r,; stands for provability in the intuitionistic propositional logic, or 
equivalently for the existence of a witness term M with i : t h M : t,. In particular it follows that 
two-way retractability implies equality of types. 

Thus, the case of simply typed /3-retracts and /3-embeddings is well understood. This is however 
not as easy for the general Prj-ca.se. It is not difficult to see that e.g. a — > b — > c < b — > a — > c, 
while a — > — > c ^ — > a — > c. The next two examples 

a^c<a^a^c but a — > c ^ a — > a — > c; 

a — > a < ((a — > a) — > a) — > a but a — > a ((a — > a) a) — > a, 

demonstrate, however, that the difference caused by the presence of 77 is not merely the order of 
arguments. 

The paper g by de' Liguoro, Piperno, and Statman proves results on retractability in simple types 
built over a single (inhabited) atom o. In particular, they show that retractability is equivalent to 
the existence of a surjection in every model, and that two-way retractability implies isomorphism. 
This result is obtained by a semantic approach via Statman's theorem [ |14| . Another main result 
of H is a complete proof system for "linear" , or more adequately, affine retractability, that is 
retractability by means of BCK-terms (at most one occurrence of each variable). 

Padovani, in a recent paper [jn), proves that the retractability relation is decidable, still under the 
single atom assumption. For this, he shows that if p < r then the coder-decoder pair F : p —> r 
and G : r — > p can be chosen in a certain syntactic form. A very useful generalization used by 
Padovani, which made it possible to use structural induction, was to replace the binary relation 
p < r between types by a relation S < r of simultaneous retractability of a type r onto a finite 
product, or equivalently, a multiset of types S. 

The following fact is essential for the proof method in |ll| : 

If pi — ► • • • — > p n — > o < T\ — » • • • — > t to — > o then Vi < n3j < rn(p; < Tj). (*) 

Unfortunately, this property does not hold with many atoms as can easily be seen from the following 
example (cf. Lemma 3.3(2)): 

b — > a < ((b — > a) a) — > a but 6 ^ (6 — > a) — » a. 
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Thus, the proof in JiT| does not carry over to the general case. The same example shows that with 
many atoms the axiom A4 of || is not correct, in consequence the system LPS of de' Liguoro, 
Piperno, and Statman is unsound. In this paper we show how to modify the axiom A4 to obtain 
a version of the LPS system, which is sound and complete for affine retractions with many atoms 
(Section ||). The proof of completeness is via an auxiliary system, which we call CF. At the cost of 
more complicated rules, the system CF is cut-free, and thus directly translatable into an algorithm 
to decide affine retractability. 

In the present paper we make a further step towards a solution of the still open problem of a sound 
and complete proof system for arbitrary retractions, namely we state a simple necessary condition 
for retractability. We show that pi — > • • • — > p n — > a < r implies that each pi is a retract 
of some part of r (cf. Padovani's condition (*)). It follows that under an appropriate tree-like 
representation of finite types, each labeled path in p can be expanded to a path in t. 

In particular it follows that: 

If p < t then rank (p) < rank (r), (**) 

where rank is the depth of our (non-standard) tree representation of types. More precisely, 
rank(a) = and rank(ri — ► • • • — ► r n —* a) = 1 + max{ rank{Ti) \ i = 1, . . . , k}. Condition (**) was 
first shown by Padovani JTl| ] for types over a single atom. 

A comprehensive study of type isomorphisms in typed lambda calculi is the book || of Di Cosmo. 
He gives complete proof rules for isomorphisms of simple as well as polymorphic types. However, 
there is no discussion of embeddings nor retractions for polymorphic types. 

The notion of a polymorphic embedding was first discussed in |l3| in connection with the issue of 
representability of recursive^] types in system F. In order to show that the type pa(b — > a) cannot 
be properly represented by means of beta-reductions, it is shown that there is no type r such that 
b — > t can be embedded into r. This fact follows from the following necessary condition: 

If P <p t then d(p) < d(r), 

where d{o~) is the depth of the ordinary representation of a as a finite tree. That is, d(a) = 0, 
rf(Va.cr) = d{a) and d{a — ► <j') = 1 + max(rf(cr), d(o~')). 

It is not difficult to observe that the above necessary condition does not hold for /377-embeddability 
or retractability. However, for some time it was conjectured that condition (**) above was true 



also for polymorphic types (if we additionally defined rank(\/a.a) — rank (a)). Example 5.1 gives 
a surprisingly simple counterexample to this conjecture. Instead, we can show a weaker necessary 
condition, namely 

If p < t then FV(p) C FV(t), 

Although very simple and natural, this property requires the same path computation technique 
as the result in |L3]]. In fact, we believe that a further refinement of this technique should give 
a solution of the recursive types problem. In particular, we conjecture the following: 



Conjecture 1.1 If p < r and t < p then p and t are isomorphic. 



If the above conjecture holds, then \/b(b — > r) ^ r, for all types r, and thus the recursive type 
pa\/b(b — > a) cannot be defined in System F. 

1 Note the difference between "recursive" and "inductive" . Inductive types are representable, see e.g. M, Il3| . 
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2 Preliminaries 



Types are denoted by r, u, . . . and type variables by a, 6, c, . . . Multisets of types are written as 
£, A, . . . If r = t\ — * ••• — » Tfc — ► a then we write a = head(r). If S = {t±, . . . ,Tk} and 
head{T\) = ■ ■ ■ = head(rk) — a then we also write head(T,) = a. 

The relation ~ on simple types is defined as the least equivalence relation satisfying: 

1) t <~ t' and a ~ cr' implies (r — > cr) ~ (r' — > cr'); 

2) (r ->■ cr -> p) - (cr -> r -> p). 

Types p and r are isomorphic if there are combinators _F : p — > r and G : r — > p such that both 
F o G and G o F are /Jry-equal to I. For simple types, it follows from (lj, |], |^] that p and r are 
isomorphic if and only if p ~ r. 

Types may be represented as finite trees in various ways. The most common method is to think 
of cr — > r as of a binary tree, the root of which is labeled by the arrow (or just unlabeled) and the 
two subtrees represent a and t. Of course, a variable a is seen as a single node labeled a. In case 
of a polymorphic type beginning with a sequence of quantifiers Va, this sequence of quantifiers is 
an additional label of the root node. 

Another approach is to represent a type o~\ — > o~% — > • • • — ► o~ n — > a as a tree with root labeled a 
and n direct subtrees representing o\, . . . , cr n . For example, type (a — > 6) — > (a — > c) — > d — > a is 
associated the following tree: 




6 c d 



a a 

We do not know about the origin of this representation (far less common than the "ordinary" one) . 
We learned it from Hanno Nickau @. 

This idea can be extended to the polymorphic case, provided we agree to identify types Va(r — > a) 
and r — > Vacr, whenever a is not free in r. Then a type of the form Voi-di — > Va2.(J2 —»■••—> 
Va n .tr„ — > Va„+i.a may be represented in a similar way, with all labels Vdi attached at the root. 
For instance, the type Vo.(a — » Vc(c — > c)) — > a is represented by 



Va6.a 




Vc.c 6 




a c 



It should be clear that for a simple type r the depth of the first representation of r is d(r), while the 
depth of the second representation is rank(r). We will always refer to the second representation. 
Note that, in this representation, every node corresponds to a target variable of a different part 
of r. Let tree(a) be the tree associated to a type a. For every path tt in iree(cr), let w(ir) be the 
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sequence of type variables occurring on tt, read in the bottom-up direction. 

Typed lambda terms are normally assumed to be in Church style, i.e., with full type information 
available. If a term M is of type a, we sometimes write M a to express this fact, but the superscript 
is not obligatory. 

A type p is called a retract of a type r, written p < r, if there exists a type environment E, and 
terms C, D, satisfying the following conditions: 

l)£(i:p)hC:r; 2) E h D : r -> p; 

3) DC =3,, x; 4) x is not free in D. 

The terms C and D are respectively called the coder and the decoder. The variable x is called the 
main variable of C . 

A lambda term M is called affine (or a BCK-term) iff each variable occurs in M at most once. 
Formally, each variable is an affine term, and XxM (respectively MN) are affine terms when M is 
affine (resp. when M and N are affine.) 

We write p< 1 r iff p < t holds with an affine coder and decoder. 

The relation < (and also < 1 ) is invariant with respect to type isomorphism. Thus, it may be 
useful to assume in the following that isomorphic types are simply identified. (Note that iden- 
tifying isomorphic types corresponds to ignoring the order of sons of any given node in a tree 
representation.) This suggests the following convention: For a multiset of types £ = {ri, . . . , Tfc}, 
the notation X — > p stands for any representative of the equivalence class [t\ — > • • • — > Tk — > p\~ (or 
it is just p, if S is empty). Whenever we use this notation, the properties in question are invariant 
with respect to ~. 

It is sometimes convenient to assume that coders and decoders are ^-expanded as much as possible. 
We define a notion of a fully rj-expanded term (simply typed, or polymorphic) as follows: 

1. If N a is fully //-expanded then so is (Xx:T.N) T ^ a ; 

2. If N a is fully //-expanded then so is (Aa.A^) VaCT ; 

3. If N is fully 77-expanded, b is an atom type, and for some sequence A of types and fully 
?7-expanded terms we have NA : b, then NA is fully ^-expanded. 

It should be clear that full //-expansion is preserved by beta reductions, and that each term has a 
unique full /7-expansion. A fully /^-expanded normal form is called a long normal form. 

If t = {n, . . . r n } — ► a, then we say that each t, is a part of r under the path a. Type p, which is 
a part of some Tj under a path w, is said to be a part of r under wo. A delayed argument of r is 
a part of r under a path of the form a k for an odd k. 



3 A system for affine retractions 

The following are axioms and rules of the system LPS of de' Liguoro, Piperno and Statman 
This system is sound and complete for affine retractions over a single atom a. 
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Axioms 



(Al) a < x a (A2) a < x t -» a 

(A3) o--»/j->t<I 1 /9->-<t->t (A4) (7 ((T — > a) — > a 

Rules 

(Rl) ^ J^P (R2) ^ V 



(J p (7 — > T O 



1 ■ r' 



It is not difficult to see that axiom (A4) is not adequate (see Lemma [3^(2)) unless we add the 
following restriction: 

head(cr) = a. 

Of course, now a is an arbitrary atom. Due to our convention of identifying isomorphic types, we 
may also skip axiom (A3). Thus, from now on, by system LPS we mean the system consisting of 
rules (Rl), (R2) and axioms (Al), (A2), (A4), the latter in the modified form: 

(A4) (j ((j — > aj — ► a, provided head(a) = a. 

We will show the soundness and completeness of system LPS. We find it also convenient to consider 
a "cut-free" system without the transitivity rule (Rl). Our cut-free system is called CF. It has 
only one axiom and the following rules : 

(Axiom) a < a 

(H) 



a < 1 cr', t O 1 t' 



cr — > t < cr — > T 



p O 1 T 

(N) 



(D) 



A 1 ^fl^ 1 a 1 ,...,A n ^a^ 1 o>, 



Ai U . . . U A n — > a f] 1 {Si — > (Ti — > a, . . . , S n — > <7 n — > a} — > a 



Clearly, rule (H) is he same as (R2). Also, it is not difficult to see that the three rules can be 
combined into one, namely 

pi <\ x ai, . . . ,p m < 1 CT m , Ai -» a < x (7i, . . . , A„ -> a < x a n 
{pi, . . . , p m } U Ai U . . . U A„^a < x £ U {<7i, . . . , a m } U {Si^ai^a, . . . , £„— ►ovj— >a}— >a 

Proposition 3.1 T/ie system LPS is sound for affine retractions: if a ^i 1 r is derivable then 
a < t actually holds. 

Proof: Similar to that in S. I 



Proposition 3.2 If a r is derivable in system CF then it is also derivable in system LPS. 
(In particular, CF is sound for affine retractions.) 

Proof: Induction with respect to the length of derivations. We consider only the case of rule (D). 
Let Ai — * a < x eri, . . . , A„ — > a <S X a n . First observe that 

A„ — > a < 1 ((A„ — > a) — > a) — > a (cr n — > a) — > a (S n — > <T« — > a) — > a 

The first inequality follows from (A4), the second from (R2) used twice. The last one follows from 
(A2) and (R2). Since Ai U . . . U A n — > a is the same as (Ai U . . . U A n _i) — » A„ — > a, it follows 
that 

Ai U . . . U A„ -»■ a < x (Ai U . . . U A n _i) (£„ -> cx„ -> a) -> a. 

But the latter type is isomorphic to {E n — > o~ n — > a} U Ai U . . . U A„_i — > a, and we can use 
a similar reasoning for A„_i, obtaining 

{E„-><7 n ->a} U Ai U . . . U A„_ x ^a <i ({E„^a„^a} U A x U . . . U 

After repeating this argument n times one ends up with the conclusion of the rule. I 



Lemma 3.3 

1) If D T ^ P C T =p v x p , and x FV(D) then the long normal form of D is of shape Xf T .Xz.fM . 

2) U P 53 t then head(p) = head(r). 

Proof: Part (1) is easy. Use (1) to prove (2). I 



Lemma 3.4 Let M = M°[X := xVi . . . V k ] xA x . . . A k B x ...B r , where M is an affine term. 
ThenM XB 1 ...B r . 

Proof: The proof is by induction with respect to the standard (leftmost) reduction. First suppose 
that M° has a head redex: M° = {\w.N)PQ — ► N[w := P]Q. Then M = ((\w.N)PQ)[X := 
xV\ . . . Vk] — > (N[w := P]Q)[X := xV\ . . . T4], and we can apply the induction hypothesis for 
(N[w := P]Q). 

If M° does not have a head redex, then M° must already in the form XB[ . . . B' r , where B[ — ►> Bi, 
for all i = 1, . . . , r. I 

Theorem 3.5 The system CF is complete for affine retractions: every true statement p < 1 t is 
derivable. 

Proof: The proof is by induction with respect to the size of p < x r. Let p = pi — > • ■ ■ — > pe — > a 
and let t = t\ — ► • • • — > T n — > a . If £ = 0, i.e., p = a then p <j x r is easily derived from the axiom by 
applying rule (N) as many times as needed. Let I > and assume that the coder and the decoder 
are given in long normal forms. We have C = Ay[ L . . . y„ n .C', and the decoder D must be of the 
form \f T \z Pl . . . z p £ -fS^ 1 . . . (cf. Lemma There is exactly one occurrence of each Zi 
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in D, within some Sj. We use the notation j — t(i). Note that, due to affinity, the variable / does 
not occur free in the Sj's. 



Each j = 1, . . . , n is classified as either passive or active, as follows. Take a fresh variable * of the 
same type as yj and consider the term C* = Xyl 1 . . . y^ 1 -C'[yj := *]. Reduce the application DC* 
to a head normal form, and inspect the head variable u. It should be clear that it must be either x 
or *, as otherwise u would be equal to the head variable of DC, that is to i. If u — x then we 
classify j as passive, otherwise j is active. 



Consider any i = 1, . . . , £. We now proceed by cases depending on j = t(i). 

Case 

form: 



Case 1: Assume that j is passive. Consider the standard reduction of DC* to its head normal 



DC* — » xN^ 1 . . . , 

and reduce the term DC according to the same pattern. We have a reduction of the following 
form: 

DC -»p Xzi . . . z t .(\y? ■ ■ ■ V T n-C')S^ ...S%> -»f, \ Zl ... z e .C'[y r := S r ]» =1 -»f, 
Xzi . . . ztJcM? . . . Mf 

This differs from the reduction of DC* in that we have Sj in place of *. The term Sj does not get 
substituted on the way, because there is no variable free in Sj that would be bound outside of it. 
Since z% must occur in Mi, we have that Mj = iVj[* := Sj]. (Note that this means in particular 
that t(j) ^ t(j') whenever j ^ j'.) In addition, M, must reduce to Zi- Now we can conclude that 

Pt Tj. 

Indeed, we can take Sj as the coder (with main variable Zi), and X*.Ni as the decoder. 

Case 2: Now let j be active. With no loss of generality, we may assume that t^d) = {k, ...,£} 
(a final segment of arguments). Assume that Tj = o\ — > • • • — » o v — > a. Again we "simulate" the 
head reduction of DC* and we obtain 

DC ^ Xz x ... zt.(\y?. . . y T n n .C')S? . . . SI- ^ Xz x ... z e .C'[y r := S r ]? =1 ^ 
-^fi Xz\... Zi.SjMi 1 . . . Mp p Xz\... zg.xzi ...z^ 

There is one occurrence of x in the above term. Assume for simplicity that this occurrence is 
within Mi, in a context of the form xW^ 1 . . . W[ e . We claim that 

{p k , ■ ■ -,pe} -> a < : ax. 

As the coder we take the term M[ obtained from Mi by replacing xWf 1 . . . Wg l by XW{ k . . . W% 1 , 
where X, the main variable, is fresh. The decoder is E = XgXz^ ■ ■ ■ ze.SjgM^ 2 . . . Mp p . 



We have EM[ — >p Xz k . . . zi.SjM^M^ 1 . . . M p p . By Lemma ^J, the latter term reduces to 
Xz k . . . z e .Xz k . . . z e . 

The conclusion from the two cases discussed above is the following. 

a) If j — t(i) is passive then pi <j x Tj, and the "passive part" of the function t is one to one. 

b) If j is active then {pi | t(i) = j} — > a < x a, where a is a certain argument of Tj. 

We take types up to isomorphisms, so we can freely assume that t(i) are passive for an initial 



8 



segment of {1, . . . ,£} and active otherwise. We can also assume that j = t(j) for the active j's. 
The other arguments of p can be split into the sets i _1 (j) for the active j's. From the discussion 
above it follows that p < 1 r is derived from inequalities of the form (a) and (b) with help of 
rule (U). Indeed, each type of the form S r — > ay — > a is a type of some yj, where j is active and a 
is the distinguished argument of Tj. The set £ corresponds to the j's outside the range of t I 



Recall that the rank of a type is defined by rank{a) — and rank(r\ — > • ■ ■ — > r„ — > a) = 
1 + max{ ranfc(rj) : i — 1, . . . , k}. Using Theorem |3.5| , one can show by an easy induct ion that 
p < l t implies rank(p) < rank(r). This also holds for non-affine retractions (Proposition [4.5| ). 



Other consequences of Theorem 3.5 are as follows: 
Corollary 3.6 

1. The relation < 1 is decidable (in nondeterministic polynomial time). 
2- If p <!* t and t 5] 1 p then p and r are isomorphic. 



Proof: (0) System CF gives a nondeterministic recursive algorithm to verify whether p f^ 1 t holds. 
The depth of recursive calls is linear, because each time disjoint parts of p and r are processed^] 

First we prove that there is no nontrivial retraction of a type onto itself, i.e., that each such 
retraction is actually an isomorphism. The proof is by induction with respect to the size of types. 
Clearly, there is no other coder from a to a but identity. For complex types, either rule (H) or (D) 
must be applied. In case of (H), we can immediately apply the induction hypothesis. Rule (D) 
is not applicable. Indeed, it follows from the induction hypothesis that the size of the right hand 
side must be larger than that of the left-hand side. 

Now suppose p < 1 r with coder C and decoder D. Also suppose r ^l 1 p with coder C and 
decoder D' . Let F = Xx.C and F' = Xx'.C, where x and x' are the respective main variables. 
Then (F o F')x and D 1 o D make a coder/decoder pair for r < 1 r. Thus, F o F' must be an 
isomorphism, and we have F o F' o D' o D = I. It follows that D = DoI = DoFoF'oD'oD = 
IoF'oD'oD = F'oD'oD. But the equality D = F'oD'oD implies that FoD = FoF'oD'oD = I, 
i.e., that F and D define an isomorphism between p and r. I 



4 Arbitrary retractions in finite types 

The proof rules of the previous section are no longer complete if non-affine coders and decoders 
are permitted. An example was given in Another one is as follows. 

Example 4.1 Let p=(e^a)^c^a and r = (e — > (a — » c — > a) — > a) — > a. 

Then p < r with the following coder and decoder: 

C = \y .yE e (\w^W2-x(\v e .yv(\wiW2.wi))w2)] 
D = XfXzf- a z c 2 .f(Xutu^ c ^ a .u 2 (z 1 u 1 )z 2 ), 

2 It is unclear if the nondeterminism can be eliminated. An exponential choice may be caused by rule (D). 
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a 
e 



where the variable y (used twice) has type e —>■ (a —>■ c a) ^ a, and E is anything of type e. 
But p t. Indeed, otherwise it is derivable in our system. The last rule in the derivation can't 
be (N), because r has only one argument and p ^ 1 a. Similarly, if (H) was the last rule then either 
c — > a 5] 1 a or (e — > a) — > a f^ 1 a. If the last rule was (D) then either p < : a or p < ] c -> a ^ a. 

Example 4.2 In the above example (as the one in ||), p is a retract of r, because p ~ E — > p', 
where all types in £ are retracts of arguments of r, and the final part p' is a retract of an 
argument of an argument of r (corresponding to the head variable of the coder). That this pattern 
does not make a sufficient condition can be seen when we replace r in the above example by 
t' = (e — > (c — > a) — > a) — > a. 



r ' = a 
a 

e a 
c 

Suppose that p < r'. Then the long normal form of the decoder must be: 

d = A/Azr°«2-/(Attf«r a -/(A«s«r° • ■ ■ /(^«r°^ a ) • • •)). 

for some fc > 1 and for some term A of type a, such that the head variable of A is no longer /. It 
should be clear that A must contain both Z\ and Z2 free, and that the head variable of A must not 
be free in D. But this is impossible. Indeed, in order to be of type a, the term A must either be 
of the form z\Ui or of the form ViZ2- 



Example 4.3 This (affine) example shows that the pattern mentioned in Example 4.2 



is not 



obligatory. We have p < r, where p = b —> c — > a and r = ((6 — > a) — > a) — > ((c — > a) — > a) — > a. 

P = a t = a 

be a a 



The coder and decoder are as follows: 
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C = Xyiy 2 .yi(Xw b .y 2 (Xu c .xwu)); 



D = A/AzJz^./(As b -* a .szi)(Ar c - >a .rz 2 ), 
where yi : (b — > a) — » a and 2/2 : ( c ~* a) - * a - 



Example 4.1 suggests that it may indeed be difficult to design a proof system for the relation <, as 
it is not clear if p < r can be at all expressed in terms of statements of the form p' < t', where p' 
and r' are, respectively, parts of p and r. On the positive side, we can show the following necessary 
condition. 



Theorem 4.4 Let p < r. TTien head(p) = head(r) and each argument of p is a retract of a delayed 
argument of t. 

Proof: That head(p) = head(r) = a, we already know from Lemma |Q| (2) . Assume that p = pi — > 
• • • — ► pi — > a and r = n — > r„ — > a. Also assume that the coder C = Xyi . . . y n .body(C) 

and the decoder D = XfXzi . . . zp.body(D) are given in long normal forms. 

By a segment we mean a maximal fully 77-expanded subterm of D or of C . Each segment has the 
form S = Xu.vR, where the terms R are segments. We write body(S) for vR. The variable v is the 
head variable and the u are leading variables of the segment. 

We will use a sequence of segments to represent an initial fragment of the reduction of the appli- 
cation DC. More precisely, we define a sequence Si, S2, ... of segments, called the primary spine, 
together with an assignment of meanings to the leading variables. Variables z\,...,zt make an 
exception and have no meanings. Variables occurring free in a segment, but bound in C or D, are 
assigned meanings, because they occur as leading variables earlier on the spine. 

First, take Si to be D. Then suppose Si, S2, ■ ■ ■ , Sk have been defined. Take Sk+i to be the 
meaning of the head variable of Sk (which must be part of some S q , with q < k). The meaning of 
the z-th leading variable of Sk+i is defined as the i-th argument in Sk- Note that since segments 
are fully expanded, the number of lambdas and arguments must match. Free variables of Sk+i are 
associated to the same lambda bindings as in Sq. 

The final meaning of a variable x is obtained from its meaning S by replacing free variables of S 
by their final meanings. Note that a final meaning of a variable does not contain free variables 
possessing meanings. The final meaning of a subterm M of a segment, denoted [M], is obtained 
by replacing free variables of M (whenever possible) by their final meanings. 

Now observe that the segments on a spine must alternate: odd numbered segments are parts of D, 
while the even-numbered segments are parts of C. This is equivalent to saying that a meaning of 
a variable from C is always a part of D and conversely. Indeed, the meaning of a leading variable 
is always taken from the previous segment, and we can prove this property by induction. 

Also by easy induction we can show that a free variable of any final meaning must be a free variable 
of C or of D. 

The next observation is that 

lbody(S k )] ^ p lbody(S k+ i)l 

This is because Ibody(Sk)} = [Sfc+iJA for some final meanings A. By performing reduction these 
final meanings are replacing the leading variables in body(Sk+i) resulting in [body^Sk+iJl- 

Since [6odj/(5i)] = \body{D)\ = body{D)[f := C], we have: 
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DC —»p \z\ . . . zt\body{Skj\, for all k. 



It follows from strong normalization that 

lbody(S k )j -» xzi ...z t . 

In particular, the sequence Si, S2, ■ ■ ■ must end exactly when the head variable of some Sk is x. 
Then \body{S m )\ = xZ\ . . . Zt, for some Z\, . . . , Zg. 

Let us now fix a j < £. We define the secondary spine T\,T2, ■ ■ ., beginning with T\ = Zj = 
Xvi . . .v r .body(Zj), in the same way as the primary spine, but taking into account meanings of 
variables defined before. An argument similar to the above shows that 

\Zj\ —»p Xvi . . . v r .\body{T k )\ — » \v-\_ . . . v r .ZjVi . . . v r , for all k, 

and that the secondary spine ends at some T q which has Zj as the head variable. 

The occurrence of Zj as the head variable of T q is considered essential. Further, suppose that an 
occurrence of Zj in some Tk for k < q is essential. If Tk was obtained as a meaning of a variable 
bound on the secondary spine, and this meaning was taken as a subterm of some T s , for s < k, 
then the appropriate occurrence of Zj in T s is also essential. Note that occurrences of Zj, which 
are not essential, might be replaced by fresh variables without affecting the reduction. 

Let T p be the first segment on the secondary spine, which contains an essential occurrence of Zj. 
Of course, T p is part of D, so p must be even, in particular p > 1. Thus T p is a meaning of a 
variable y which is bound in C. 

We claim the following: if y is a bound variable in C which has a meaning, then the type of y is a 
part of t. In addition, this part must be at a distance of an odd length. 

This is clear if y is a leading variable of C. Otherwise, y occurs as a leading variable of a term 
Xy.A, which is an argument of another variable y' . In order for y to have a meaning, also y' must 
have a meaning, and thus we can proceed by induction. (Note that the difference in the length of 
the distances is 2.) 

We conclude that the type r' of of T p is a part of r at a distance of an odd length. (We still need to 
know that this distance consists exclusively of a's.) We will show that pj < t' with the coder [T p ]. 

First of all, note that the variable y, of which T p is the meaning must be bound on the primary 
spine and not on the secondary one. Otherwise, T p would not be the first segment with an essential 
occurrence of zj . This also means that variables Vi . . . v r do not occur in [T p ] . Indeed, by an easy 
induction one can show that a free variable of any final meaning on the primary spine must be 
a free variable of C or of D. 

Let the term T' be obtained from body(T p —i) by replacing the head variable y by a fresh variable Y. 
Define T" = XYXvi . . . w r .[[T'J|, and let T be T 1 with all occurrences of Zj replaced by fresh z'j. 
These occurrences of zj are not essential, and as we have observed, they have no influence on the 
reduction. Thus, T|T p ] reduces to Xvi . . . v r .ZjVi . . . v r , i.e., the term T is a correct decoder. 

It remains to prove that r' is a delayed argument of r. First observe that the target variable of 
types of all segments at the primary spine must be a, because this must be the same variable as 
the target of p, the type of x. Let y be the variable of which T p is the meaning. We have already 
observed that it is a leading variable of some Si on the primary spine. If this Si is the meaning 
of some y', then the target of the type of y' is a, and y' must be bound on the primary spine too. 
Thus, t' is a part of the type of y' at the distance a. Note that y' is a variable bound in D. Moving 
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this way upward the primary spine, we see that r' must be a part of the type r of / at a distance 
built only from a's. I 



For two words, 

some w',w". The symbol C stands for the transitive and reflexive closure of C°. The following is 



an easy consequence of Theorem 4.4. 
Proposition 4.5 

1- If P 53 t then rank(p) < rank(r). 

2- If p <! t and it is a path in p then there is a path tt' in r 10(71") E w(jr'). 



Unfortunately, Theorem 4.4 does not suffice as a basis for a decision procedure for the relation <, 
and the decidability of < remains an open problem. However, the other part of Corollary 3.6 is 



true, even if we can not give a direct syntactic proof. Indeed, we can modify the semantic proof 
given in H for the single atom case. 



Proposition 4.6 If p <\t and r < p then r and p are isomorphic. 



Proof: It is shown in || (Proposition 2.4) that this property holds for types over a single atom. 
Let p° (resp. r°) be obtained from p (resp. r) by replacing all type variables by a single type 
constant o. Of course p < r and r < p implies that p° < r° and r° <j p° . Proposition 2.4 in |s| 
states that p° and t° must be isomorphic. It is of course not enough, but what is actually shown 
is that in this case an arbitrary retraction is an isomorphism. Let C and D be respectively the 
coder and the decoder for p < r. Then, of course, C and D work also for p° < r°, and thus 
(Xx.C) o D —p v I. This property does not depend on types assigned to C and D, and thus r and p 
are isomorphic too. I 



5 The polymorphic case 

Example 5.1 Let £ = ((g — * /) — * e) — > d. Take p = ((£ — > £ — » c) — > b) — > a, and r = Va(((a 
a — > c) — > 6) — > (a — > £) — > a). (See the picture.) 



Then both the rank (depth) of p and its number of nodes are greater than of r. But we have p < r 
with: 

C = Aa\yz.x(\x2-y(\vu.X2(zv)(zu))), 
D = \f\ Xl .fZxxI. 

The above example shows that the properties of < do not carry over from the simply typed to the 
polymorphic case. We can still however give the following necessary condition for for polymorphic 
retractability: a < r implies FV{a) C FV(t). The remainder of this section is devoted to a sketch 
of proof of this property. 

If we assume that coders and decoders are always fully ry-expanded, then a reduction DC —^pn x 
can be split as DC X —» v x, where X is the full ^-expansion of x. (This is because no r\ 
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redex can block a (5 reduction step.) Thus, from now on, we think of X rather than of x, as of the 
final stage of the reduction, and we only need to talk about /3-reductions. 

Let us have a closer look at a full ^-expansion X of a variable x : p. The only free variable of X 
is x and we may assume that all bound variables are chosen different. It is not difficult to see that 
there is a bijective correspondence between variables in X and nodes in the tree representation 
of p. The variable x corresponds to the root, and then we have: X — AaXxi.xaXi, where X\ are 
full /^-expansions of x\. Each of variables in x\ corresponds to a son of the root, and then we apply 
induction. Write node(y) for the node corresponding to y. Note that if y : p' occurs in X then the 
target variable of p' is the label of node(y). The rank of y is defined as the depth of node(y), and 
if node(y) is a son (father) of node(z), then we also refer to y as a son (father) of z. 

From now on assume that x : p \- C : t, \- D : t ^ p and DC —»p X, where X is the full 
77-expansion of x. Of course C and D are fully ^-expanded. Identify the variables of X with their 
ancestors in DC. 

Lemma 5.2 (The ancestors of) all variables of X of even depth occur in C, while (the ancestors 
of) all variables of odd depth occur in D. 

Proof: Of course the ancestor of x occurs in C, and it is not difficult to see that D = AaXfXxi .... 
where x*\ are all the (ancestors of) variables of depth 1. Then we can proceed by induction. 
A variable y of depth i must occur in X in a context of the form: . . . Af3Xy\.y[3Yi, where Y\ are 
full ^-expansions of y\. If some y\ £ y\ is of type of depth greater than zero, then the corrsponding 
member of Y\ is Y\ = A^Ay^-yi^- Suppose the ancestor of y occurs in C. It must occur in 
a subtcrm of the form ySY{ which is eventually turned into y(3Y\ by a series of substitution. Since 
Y[ are fully ?y-expanded, they must begin with AjX^.yi ■ ■ ., and thus all variables of level i + 2 
must also occur in C. I 

We represent lambda terms as graphs, and we consider computations along these paths, as in p3[ . 
We refer the reader to the latter paper for detailed definitions. 

Proposition 5.3 If p<r then FV(p) C FV(t). 

Proof: Let a £ FV(p). If a is the target of p, then it must also be the target of r, because the 
decoder must have the form AaXf.f . . . 



14 



Otherwise, let y be a variable in X , of level i + 1 > 0, which corresponds to an occurrence of a, 
and let z be the father of y. There is a path II in X from y to z and a computation along II, 
beginning at the configuration (y,(q k )) and ending at (z, (q e pq k )). This path II is obtained as 
a result of a sequence of deformations from a path IIo in DC, also from y to z. Using a reasoning 
almost identical to that of we can show that there is a correct computation along IIo, an d 
that this computation is "good", i.e., every compression begins a V-cycle ending at the same node 
with a matching decompression. Indeed, the proof of the crucial Lemma 11, remains essentially the 
same. The only difference is as follows: the reason we must leave a term of the form (Aa.Af /aT a 
entered via its top node with a compression is not because we must eventually visit the top node 
of the whole term. Instead, we must reach z, with a push-down store pointing to a free occurrence 
of a. However, within the term Aa.A our wires are "below" occurrences of a in all the types they 
cross. 

By Lemma [5.2| , the path IIo either begins within C and ends within D or conversely. In each case, 
we must cross the type r of C. But if we do it in a compressed configuration, we must necessarily 
return back. Thus, at least once, we do it without any compression, and this means that a must 
be free in r. I 



Proposition 5.3 is of course useless when e.g. p < r and p is a closed type. We conjecture that there 



is always an injection from bound variables of p to bound variables of r. However, a generalization 



of Proposition 5.3 to handle bound variables will require a finer analysis. 



Indeed, in the (3rj case, it is in general not true that every compression must have a matching 
decompression. Consider, for instance a variable x : Va(a — > a), and its full 77-expansion X = 
Aa\y a .xay. Then the path from y to x in X gets "compressed" at the type application xa and is 
never "decompressed" . This happens because a is a bound variable. 



The analogue of Proposition 4.6 for polymorphic types is only conjectured. 



Conjecture 5.4 If p < t and r < p then p and r are isomorphic. 

A possible way to prove this conjecture would be to show first that every self-retraction is an 
isomorphism: 

Conjecture 5.5 If t <t with coder C and decoder D then (Xx.C) o D =p n I. 



Proposition 5.6 Conjecture 5.5 implies Conjecture 5.4 



Proof: Same as the proof of Corollary ^(||) I 
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